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A promising way to introduce general relativity in the classroom is to study the physical impli¬ 
cations of certain given metrics, such as the Schwarzschild one. This involves lower mathematical 
expenditure than an approach focusing on differential geometry in its full glory and permits to 
emphasize physical aspects before attacking the field equations. Even so, in terms of motivation, 
lacking justification of the metric employed may pose an obstacle. The paper discusses how to estab¬ 
lish the weak-field limit of the Schwarzschild metric with a minimum of relatively simple physical 
assumptions, avoiding the field equations but admitting the determination of a single parameter 
from experiment. An attractive experimental candidate is the measurement of the perihelion pre¬ 
cession of Mercury, because the result was already known before the completion of general relativity. 
It is shown how to determine the temporal and radial coefficients of the Schwarzschild metric to 
sufficiently high accuracy to obtain quantitative predictions for all the remaining classical tests of 
general relativity. 
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I. INTRODUCTION 

Conceptually speaking, general relativity (GR) is not 
a particularly difficult theory. From the viewpoint of 
physics education, all the conceptual impositions of the 
relativity theories arguably arise with special relativity 
(SR) already. It is in courses of SR that students will 
be exposed to the relativity of simultaneity and a va¬ 
riety of paradoxes, having to do with time dilation and 
differential agingjl^ length contraction and the pole-barn 
paradox;^ bridges that may or may not collapse under rel¬ 
ativistic trains?^ relativistic lever experiments appearing 
to violate angular momentum conservation;^ Bell’s space¬ 
ship paradox;^ Ehrenfest’s paradox^^— and the appear¬ 
ance of non-Euclidean geometry in accelerating systems 
such as a rotating diskjiSri^ Those students will probably 
not be overly shocked by the additional complication of 
spacetime curvature in GR. 

And this is essentially the only conceptual complica¬ 
tion. Some things even get simpler with certain stan¬ 
dard examples of GR systems. In SR, we have the be¬ 
wildering phenomenon of mutual time dilation, utterly 
incomprehensible without an understanding of the non¬ 
absoluteness of simultaneity. When comparing coordi¬ 
nate stationary observers (GSOs) in a static metric, there 
usually also is time dilation, but different observers agree 
on whose clocks are runnig faster, a situation that is not 
especially difficult to visualize. 

What makes GR difficult, is the mathematical over¬ 
head. SR can be taught with very little calculus, whereas 
in GR, differential geometry is essential. The field equa¬ 
tions of GR are intrinsically nonlinear, so their solution 
is, even in the simplest cases, nontrivial. The Riemann 
curvature tensor has 20 independent components. 

Given the conceptual simplicity and the mathematical 
complexity of GR, it is natural to ask whether it is pos¬ 
sible to find a simpler approach to certain fundamental 
aspects of the theory, to make it more accessible to stu¬ 
dents in the transition from special to general relativity. 


A full-fledged course in GR will have to deal with the 
field equations eventually, but the entry point into the 
theory might be based on much simpler considerations. 
An SR course giving a glimpse at GR near its end may 
benefit from avoiding the field equations altogether. 

It is a substantiated view that exploring the con¬ 
sequences of a particular metric (normally the Schwarz¬ 
schild one) leads to an accessible “physics first” approach 
to introducing GR j^^d^ Unfortunately, the metric will 
arise out of the blue in such a strategy. Therefore, it is 
legitimate to inquire whether we can do better and ob¬ 
tain nontrivial metrics from simple arguments, without 
going all the way to the field equations. 

Such an idea was implicit in the so-called Lenz-Schiff 
argument, apparently never published by Lenz but pre¬ 
sented in Sommerfeld’s textbooki^ and used by Schiffi^ 
to argue that light deflection by the sun is quantitatively 
describable without the field equations. This would put 
it on a par with the gravitational redshift of spectral lines 
in the field of a weakly gravitating object such as our sun, 
known to be explicable by a combination of special rel¬ 
ativity with the Newtonian limit (NL), using Einstein’s 
equivalence principle (EP). The perihelion precession of 
Mercury would then remain the only one of the three 
classical tests of GR that really probes the field equa¬ 
tions. 

Schiff’s paper was shown to be in errorj^iiS Neverthe¬ 
less, recurrentlyiS and even recently^ articles have been 
published that “derive” GR effects requiring spacetime 
curvature on the basis of the fallacious Lenz-Schiff argu¬ 
ment. Yet, detailed arguments had been given^iiS^ that 
a simple derivation of the Schwarzschild metric, i.e., one 
avoiding knowledge that traditionally is gathered from 
the held equations, is impossible. 

The appearance of a controversy may be deceiving. 
Advocates of the Lenz-Schiff argument seem to be un¬ 
aware of its dehciencies. In contrast, anyone familiar with 
the foundations of GR will realize that any nonsingular 
metric (with Minkowskian signature) is locally compati- 
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ble with SR, due to the equivalence principle (stating that 
it always possible to transform the metric to Minkowski 
form locally). Therefore, the EP does not constrain the 
metric. Without the field equations or some equivalent, 
constraints on the metric arise only from symmetry and 
the NL. Any corrections to the NL, expressible in powers 
of the small quantity GM/c^r for a spherically symmet¬ 
ric situation^ must be missed by a Lenz-Schiff type ap¬ 
proach as, in fact, by any other approach based on local 
considerations within the framework of SR. Therefore, 
no more than a weak-field approximation to the metric 
can be gained. This does not yet exclude Schiff’s result, 
essentially referring to the first-order term in powers of 
GM/c^r of the radial metric coefficient grr- However, the 
aforementioned analysea^is^^ show that the Newtonian 
limit gives us only g^r = 1. Thereafter, any claims to de¬ 
riving the Schwarzschild metric or even a post-Newtonian 
approximation to it on the basis of just symmetry, SR, 
the EP, and the NL, are recognizably erroneous. 

To be precise, this does not mean that one cannot do 
without the field equations. 

Essentially, either the field equations or their generat¬ 
ing action including its Einstein-Hilbert part are a set of 
postulates within GR. In most theories based on postu¬ 
lates or axioms, the axioms are not unique. In set theory, 
for example, the axiom of choice, Zorn’s Lemma and the 
well-ordering theorem are all interchangeable!^ It is suf¬ 
ficient to postulate one of them. The other two are then 
derivable. In thermodynamics, there are various different 
formulations of the second law, which is a postulate of 
the theory. It is sufficient to take one of them, then the 
others can be derived as theorems!^ 

If consideration is restricted to the static spherically 
symmetric case, it may be possible to use a simpler pos¬ 
tulate (or two) than the one leading to the field equations 
to derive a metric. Due to the restriction to spherical ge¬ 
ometry, there is no need that the postulate be powerful 
enough to replace the field equations altogether. It is 
sufficient, if it can replace them in spherically symmet¬ 
ric situations!^ This kind of approach is not only log¬ 
ically possible, it has even been discussed favorably by 
Sacks and BalU^ with regard to Tangherlini’s postula- 
tional approach to the Schwarzschild metric!^! Unfortu¬ 
nately, Rindler later showed one of the two Tangherlini 
postulates to be unconvincing!^ But clearly, Tangher¬ 
lini’s approach is not subject to the criticism (nor the 
impossibility proof) offered by Gruber et al^ 

Since both postulates from Ref. [13 cannot be used 
here, the exact Schwarzschild metric will not be obtained. 
However, I use one argument beyond the aforementioned 
ingredients (symmetry, SR, EP, NL) to restrict the form 
of the metric. This will reduce to the more convinicing 
one of Tangherlini’s two postulates, and it will be better 
justified than his statement. On the other hand, addi¬ 
tional information will be needed to obtain a truly post- 
Newtonian approximation, and this can be taken from 
experiment. Amusingly, all of this information was avail¬ 
able in 1911, when Einstein published a calculation of 
light deflection by the sun^ reproducing von Soldner’s 
century-old result^ and thus missing the correct predic¬ 
tion by a factor of 2. 


The general outline of the paper is as follows. In Sec.HIl 
the metric describing the closest approximation to a uni¬ 
form gravitational field that is relativistically possible, 
also known as the Rindler metric, is derived. This sim¬ 
ple problem, rigorously solvable within special relativity, 
serves to expose the interplay of symmetry arguments 
and thought experiments allowing us to obtain metric 
coefficients, an approach that then may be applied to 
more complex situations. It also demonstrates a little- 
appreciated property of the Rindler metric justifying its 
interpretation as describing a uniform gravitational field. 
Section IhII is devoted to an introductory attempt at con¬ 
structing the metric of a spherical mass distribution us¬ 
ing symmetry, the EP and the NL. At first sight, this 
approach succeeds in obtaining the exact Schwarzschild 
metric. Its deficiencies become visible on analysis of the 
order of approximation achieved. A plausible and sim¬ 
ple physical assumption turns out to partially cure the 
problem. In Sec. El it will be shown how information 
from a true experiment?^ viz. measurement of the per¬ 
ihelion precession of Mercury, may then be used to re¬ 
solve the issue and to obtain the weak-field limit of the 
Schwarzschild metric with sufficient accuracy to quanti¬ 
tatively predict light deflection by the sun, as demon¬ 
strated in Sec. El and the Shapiro delay!^ That section 
also discusses how the aforementioned factor of 2 can be 
found without actually performing the full calculation. 
Einally, some conclusions are given in Sec. IVII 

Most of the material is presented as if we did not know 
GR yet, but occasionally this stratagem is dropped to 
address teachers directly, who are assumed to be more 
knowledgeable. We shall assume general acquaintance 
with SR and the use of different coordinate representa¬ 
tions of the Minkowski metric. 


II. DERIVATION OF THE METRIC OF A 
“UNIFORM” GRAVITATIONAL FIELD 

Our first aim is to deal with the inertial field inside an 
accelerating object, say a big spacecraft. This situation 
may be completely described within SR. At each point, 
the acceleration is to be constant in time, i.e., each ob¬ 
server feels a constant proper acceleration. In Newtonian 
physics, we would get something resembling a uniform 
gravitational field, if all observers had the same acceler¬ 
ation. In relativistic physics, we know that if observers 
arranged along the direction of acceleration had the same 
proper acceleration. Bell’s spaceship paradox^ would 
apply - they would find each other moving apart. Rather, 
we are interested in a situation that is considered static 
by all observers. This is Born rigid motion?^ in which 
the proper distances between our aligned observers re¬ 
main constant, which means that from the vantage point 
of an inertial system leading observers must accelerate 
more slowly than trailing ones, so their distance shrinks 
precisely according to the appropriate Lorentz factor. 

Let us introduce an inertial frame E with time T and 
cartesian coordinates X, Y, Z and have its X axis ori¬ 
ented parallel to the direction of acceleration. Consider 
first a single (point-like) observer O. Since his proper ac- 
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celeration is constant, he will feel a constant accelerating 
force /o, which is also the force, by which a momentar¬ 
ily coinoving inertial observer (in frame S') will perceive 
O to be accelerated. Moreover, the relativistic transfor¬ 
mation law for forces parallel to the vector of relative 
motion between inertial systems tells us that the force 
Fq, by which O is accelerated in E is the same as in E': 
Fq = fo- Then the equation of motion for O’s trajectory 
in E reads 

-^m'y(y)V = fo , (1) 

where V{T) = dX/dT and 7(1^) = l/y^l — jc^. If we 
set the time T equal to zero at the moment when V = 0^ 
this is solved by 


setting X = X 2 and choosing T = T 2 so that t = 0. This 
gives 

T 2 = Ti + \{X2 - Xi) , 

D = X2 = {X2-X^)h ^ T2=Ti + ^^^^D. (7) 

Requiring in addition that the velocity of O 2 with respect 
to Oi be zero, we find ^\x^x 2 ~ ^ (obviously). This 
implies oiTi = 02 X 2 , which together with ([2]) and © 
leads to 01 X 1/02 = Xi (l -I- oiX/c^). We then obtain the 
important relationship 

D = -= X 02 - Xoi . (8) 

02 Oi 


j{V)V = aT, (2) 

where a = /q/to is the proper acceleration of O. Solving 
for V, we have 



and this can be integrated once more to obtain 



The trajectory X(T) is a hyperbola rather than the 
parabola known from Newtonian physics, hence the no¬ 
tion of hyperbolic motion. Note that by taking the time 
derivative in ([3]), we end up with the standard relation¬ 
ship for the transformation of longitudinal acceleration: 
A = d^X/dX^ = a/j{V)^ . Next, consider two observers 
Oi and O 2 , starting at Xqi and X 02 with proper accel¬ 
erations ai and 02 , so their trajectories are given by 



and require the distance between them to remain con¬ 
stant in the frame of the firsti^ The Lorentz transforma¬ 
tions from E to an inertial frame momentarily comoving 
with Oi read 


a: = 7(X-Xi-R(X-Xi)) , y = ^ 1 + , 

t = 7(x-Xi-^(X-Xi)^ (6) 

and they transform the point (Xi, Xi) to the origin of the 
comoving inertial observer, in whose frame the distance 
D between Oi and O 2 at this moment is obtained by 


Thus, by choice of the origin of E, we may achieve 
Xoi = I o-i for observer Oi where originally i = 1,2, 
but evidently, this can be extended to an arbitrary num¬ 
ber of observers. Equation dH) for the trajectory of an 
observer starting from Xp = x then simplifies to 

X = \/x2 -h c2X2 . (9) 

If we fill a half-space with observers labeled by their pos¬ 
itive initial coordinate x and have them move according 
to (IH) with their Y and Z coordinates unchanged, the 
ensemble will perform Born rigid motion. 

To obtain the metric describing the common rest frame 
of these observers, we note that translational symmetry 
in the y and 2 ; directions as well as the requirement of 
time independence of the metric imply the following gen¬ 
eral form of the spacetime line element!^ 

ds^ = -F(i) c^dt^ F G{x) dx^ + dy^ F dz^ . (10) 


Here, we have temporarily garnished one coordinate with 
a tilde, because for G(x) ^ 1, the proper length ele¬ 
ment d^ of this metric (d£^ = ds^|dt=o, due to time- 
orthogonality) does not have its standard form, whereas 
our relationship for the proper acceleration derived above 
was formulated in terms of the proper distance. The sim¬ 
ple coordinate transformation 


x(x) 



( 11 ) 


turns m into 

ds2 = -f(x) c^df F dx'^ F dy"^ F dz^ , (12) 

so all that remains to be determined is the function f{x). 

The proper time of a CSO, i.e., an observer satisfying 
dx = dy = dz = 0, is given by (ds^ = —c^dr^) 

dT = a/ f{x) dt , (13) 

so observers at positions xi and X 2 will find their stan¬ 
dard clocks be subject to time dilation according to 


dT2 ^ j f{x2) 

dn y f{xi) ■ 


( 14 ) 
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To determine the time dilation factor, imagine that Oi 
sends an electromagnetic signal having the frequency vi 
to the very close O 2 , who will receive it at frequency 
r' 2 . A good inertial frame to discuss this in is the frame 
momentarily comoving with Oi at the emission event. 
During the short time interval Ar = [x 2 — xi)lc taken 
by the light, O 2 will have picked up the small velocity 
V = a{x 2 )^T in this inertial frame. If X 2 > xi, O 2 is 
moving away from the emission event (since both the 
signal and O 2 are moving towards increasing x), so the 
frequency 1^2 on reception of the signal will be lower than 
1 ^ 1 , due to the Doppler effect. The relativistic Doppler 
shift is given by 


vi jc + v V a{x2)AT a(x2)Ax 

— = \ - ~ i H- ^ I -\ -1 H-- . 

V 2 \ C — V C C 

(15) 

We introduce an acceleration potential via d4> = a(x) dx. 
Then (ITTl) and (ITSl) imply (dT 2 /dTi = V 1 IV 2 ), for suffi¬ 
ciently small X 2 — xi'. 


f{x2) ^ , d>(x2) - $(xi) 

f{xi) 


(16) 


which can be easily converted into a differential equation 
for /(x). 


I f{x + dx) - /(x) 

fix) 


= 1-1- —$'(x) dx , 


2 fix) c2 


(17) 


and this is solved by 


fix) = , (18) 

where the integration constant can be chosen by fixing an 
additive constant implicit in the definition of $(x). Since 
we know the position dependence of the proper accelera¬ 
tion a(x), it is straightforward to obtain the potential: 

dm 9 

— = a(x) = — => m(x) = In X-|-const. (19) 

dx X 

This gives 


fix) = , (20) 

where g is the proper acceleration at the position xq, 
where /(xq) = 1. 

Even though we have now successfully derived the 
Rindler metric 

d^2 ^ - + dy^ + dz^ , (21) 

it is useful to look at another thought experiment. Con¬ 
sider an observer at Xq who slowly lowers some small 


mass m, hanging from an inextensible massless tether, 
towards smaller x values. 

Of course, in relativity, there are no truly inextensible 
bodies, because they would allow infinitely fast signaling 
(pull at one end to immediately transfer a message to the 
other). However, we do not need more than approximate 
inextensibility. An inextensible tether is simply one with 
a very large Young’s modulus Y. The larger we make Y, 
the better the approximation. The kind of inextensibility 
we want here may be expressed in terms of the covariant 
formulation of Hooke’s lawi^ All length changes required 
by relativistic kinematics are allowed, but local proper 
length increments of the tether do not change under a 
force^i The only reason for this requirement is to avoid 
the consideration of elastic or plastic effects. Since we use 
our tether for quasistatic transport only, fast signaling 
will not occur.— 

What is important in the following is that if a piece d£ 
of the tether is threaded down at its upper end, the lower 
end will move down by the same amount d£ in terms of its 
local proper length. The question we ask then is: what is 
the force needed to hold the mass at position x? At the 
beginning of the process, i.e., at xq, we clearly expect the 
force to be A = —mg, but as the mass is lowered, it will 
experience different local proper accelerations. A way to 
calculate the force is to invoke energy conservation. On 
being lowered, the mass is doing work, so we should have 

79 d£;(x) d£;(x) 

where E(x) is its energy at position x, as judged by the 
observer at xq. Now locally, the mass always has en¬ 
ergy mc^, as it does not acquire kinetic energy - the ex¬ 
periment is performed quasistatically. But the observer 
at Xq will not assign this local value to energy, because 
to him everything at x happens at a slower rate due to 
time dilation. This reduces the energy of photons by 
the time dilation factor. Clearly, all other energies must 
be affected the same way, otherwise no consistent phys¬ 
ical description would be possible. To see this in more 
detail, imagine that the energy of massive particles is re¬ 
duced by a factor /3 < 1, that of photons only by a factor 
a > p. Suppose the local observer has an electron and 
a positron annihilate to produce two photons, the en¬ 
ergy of which locally is given by the sum of the particle 
energies: 2hi' = + Ef,+. For the distant observer, en¬ 

ergy conservation would be violated, because the energy 
of the two photons would be 2ahv, that of the particles 
PiEe + Ae+), and 2ahv > l3{Ee + Ef,+ ). By an appropri¬ 
ate procedure, with the lower observer sending photons 
to the upper one, who converts them into particles that 
he sends down, where they are converted into photons 
again, a perpetuum mobile (of the first kind!) could be 
built,^ 

A perpetuum mobile could of course also be con¬ 
structed, if a < /3, using the reversed sequence of pro¬ 
cesses. 

From these considerations we conclude 


Eix) 



V'/(x) mc^ , 


(23) 
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(because f{xo) = 1), hence 

which evaluates to -F = —mg. Therefore, the force ex¬ 
erted by a mass hanging from a tether is constant for 
a given observer;^ no matter by how much it is low¬ 
ered in the “inertial field”, a fact that has been noted by 
Gr0n before4i This is the meaning of “uniform” when we 
are talking about the uniform gravitational field — homo¬ 
geneity of the force on a particle or an object in a fixed 
observer’s frame rather than homogeneity of acceleration 
(i.e., homogeneity of the force per unit mass). A detailed 
discussion of the issue of uniformity of fields in GR is 
given in Ref. IH. 

Note that we could have derived the metric by requir¬ 
ing the tether force to be constant and using This 

derivation would be less rigorous than the one actually 
given but it would be physically well motivated and the 
result would be valid. 

The Rindler metric is related to the Minkowski metric 
by a coordinate transformation 

cT = X sinh —t, X = x cosh —t, Y = y , Z = z , 
c c 

(25) 

so it describes a flat spacetime, the curvature of which 
necessarily vanishes. In modern parlance, gravity is of¬ 
ten identified with the curvature of spacetime, but Ein¬ 
stein’s view rather was that inertial and gravitational 
fields are identical in nature. Moreover, while the equiv¬ 
alence principle declares this identity only locally for in¬ 
homogeneous gravitational fields, it is not inconceivable 
that a mass distribution (homogeneous in y and z) could 
be constructed theoretically that would produce the met¬ 
ric (ED) in the vacuum delimiting it above some x value. 
Would we then refuse to call the corresponding attractive 
field gravitational just because spacetime happens not to 
be curved? 


III. THE METRIC OUTSIDE A SPHERICAL 
MASS DISTRIBUTION 

Next, we would like to extend the ideas developed so 
far to a nontrivial gravitational field, one that cannot be 
obtained by a simple coordinate transformation from the 
Minkowski metric. 

A leading theme in general relativity is to explain grav¬ 
ity in terms of spacetime geometry; in particular, the 
notion of curvature of spacetime becomes important. A 
basic object in describing spacetime geometry is the met¬ 
ric. There are more complex objects, derivable from the 
metric (such as the Riemann curvature tensor), that may 
be used to decide whether the metric describes a flat or 
a curved spacetime. None of these objects will be needed 
here. It may be sufhcient to say that if we write done 
some metric randomly, it is much more likely that it will 
describe a curved spacetime than not. A condition for 


flatness is that there exists a global coordinate transfor¬ 
mation that takes our metric to Minkowski form. Since 
this is true only when certain integrability conditions are 
met (that are expressible via the Riemann tensor), most 
metrics that we may care to write down will comprise 
curvature and this is related to the presence of gravita¬ 
tion according to GR. 

One of the simplest gravitating systems is a time-inde¬ 
pendent spherically symmetric mass distribution. We ex¬ 
pect it to be describable by a static spherically symmetric 
metric. As we shall convince ourselves presently, the line 
element may then, without loss of generality, be written 
as 

ds^ = -/(r) g{r) dr^ -|- (di?^ -|- sin^ d dgf^) . 

(26) 

Here, d and are the usual angular coordinates which, 
due to spherical symmetry, may only appear in the com¬ 
bination dn^ = di?^ -I- sin^ d dfi^ but not in any of the 
coefficient functions. Because the metric is assumed time 
independent, none of the coefficients may depend on t. 
Therefore, all of them may be functions of the radial co¬ 
ordinate r only. The prefactor of dH^ might contain some 
additional function of r, which we can however get rid of 
by redefining r so that the surface of any sphere about 
the coordinate center, given by r = const., becomes dirr^. 
Finally, a term of the form hir) dt dr would be allowed 
by symmetry, but can be removed by a coordinate trans¬ 
formation t —>■ t -I- w(r).— 

At large radii, gravitation will become negligible, so 
the metric should approach Minkowskian form, hence we 
require 

lim /(r) = I , lim g{r) = I . (27) 

r—)-cxD r—^■oo 

So far, we have not used any physics, just symmetry. 
In order to determine /(r) and g{r), we need to invoke 
physical ideas. 

First, we make use of the equivalence principle. In¬ 
stead of translating the physics in an accelerating system 
into terms of a gravitating one, which requires to visual¬ 
ize two different but equivalent systems in parallel, let us 
consider a freely falling observer in the actual system un¬ 
der consideration!^ The prescription then is to describe 
local physics in the frame of that inertial observer by SR. 
For the freely falling observer, there is no gravitational 
field and everything that the gravitational field does to 
CSOs must be due to the fact that they are accelerating 
with respect to his inertial frame. Note that an infinity 
of freely falling observers may be chosen at any point. 
Normally, the best choice is to consider one that is mo¬ 
mentarily at rest with respect to the object (e.g., a CSO) 
that is to be described. 

Using the EP, we obtain a relationship between the two 
functions to be determined and the local gravitational 
acceleration. Gonsider two very close GSOs A at ri and 
B at (the same angular position at) r 2 with r 2 > ri, plus 
an inertial observer C momentarily comoving with A, the 
moment A sends a light signal to B, as depicted in Fig.[D 

In C’s frame, the frequency vi of the signal is un¬ 
changed during its short transit time At. But B, having 
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FIG. 1. Equivalence principle: The inertial observer C arrives 
at the apex of his trajectory next to A the moment A sends 
off a signal to B. 


accelerated to a small velocity v = a(r 2 )Ar, will receive 
it at a reduced Doppler shifted frequency V 2 . The de¬ 
viation of the ratio of frequencies from 1 is attributed 
to time dilation by A and S, who are stationary. From 
ds^ = —c^dr^, we read off that the proper time of a CSO 
is given by dr = y/f{r) dt, so the frequency ratio may be 
calculated as 


^ ^ _ Vf{r2) 

^2 dn y/fin) ■ 


For C, the special relativistic Doppler shift formula yields 


1^1 c+v , V 

— = \ -1 H— . 

1^2 \ C — V C 


(29) 


where the integration constant has been fixed by the 
boundary condition (1271), given that, for large r, $ must 
reduce to the Newtonian potential, i.e., go to zero in the 
standard gauge. Since we do not know anything about 
$(r) for small r, Eq. (1341) means no more in that r range 
than expressing one unknown function, f(r), in terms of 
another equally unknown one, $(r). All we achieve by 
this is to equate /(r) to a quantity that has the physical 
interpretation of a potential. Nevertheless, the result is 
useful, as it is this interpretation that allows us to deduce 
the functional form of /(r) at large r by the requirement 
that the potential become Newtonian there. 

To obtain a second relationship for the two functions 
/(r) and g{r), let us look at the same thought experi¬ 
ment as in the case of a uniform field. Assume that an 
observer at tq slowly lowers some mass m hanging from 
an inextensible massless tether towards smaller r values. 

As before, we express the force using energy conserva¬ 
tion. 


dE{r) dE{r) dr 

d£ dr di ’ 


(35) 


where E{r) is the energy of the mass at position r, as 
judged by the observer at tq. By the same kind of argu¬ 
ment as in the uniform gravitational field we now obtain 


E{r) 



-)• 

ro^oo 


V/(r) mc^ , 


(36) 


Denote by A£ the local proper distance separating A 
and B. The metric is time orthogonal, therefore d£^ = 
ds^|dt=o- We then have, for the transit time, Ar = 
M/c = y/g{r) {r 2 — ri)/c (with r G [ri,r' 2 ]). Due to the 
closeness of the three observers and the smallness of their 
relative motion, the length A^ is the same for all of them 
during the sequence of events considered. It is again use¬ 
ful to introduce an acceleration potential, descri bing lo¬ 
cal proper acceleration, via d$ = a(r) d£ = a{r)y/g{r)dr. 
This provides 


c 


aAi _ A$ 

2 2 ~ ’ 


(30) 


where for simplicity our observer was moved to infinity. 
The force felt at the upper end of the tether then is 


77! 2 

r = —me 


1 

y/^ 


fir) 


2 \//( 0 ffW 


(37) 


Now we require that for large r, $(r) and F(r) take their 
Newtonian limits, i.e.. 


$( 7 -) 

F(r) 


GM 


GmM 


^2 


(38) 

(39) 


and, for sufficiently small |r 2 — ri\ 


fir2) ^ 4>(r2) - d»(ri) 

/(d) 


a(r) 


1 d$ 


(31) 

(32) 


The first of these two equations is converted into a dif¬ 
ferential equation as before 


At this moment, we have no idea about what will become 
of these laws as relativistic effects become strong, so we 
content ourselves with determining a weak-field limit of 
the metric. The relevant quantity distinguishing between 
weak and strong is <I>(r)/c^, which outside the sun does 
not exceed 10 “^ in our solar system, so this limit should 
be appropriate for all calculations referring to the latter. 

Equations (IMl) and (1571) together with expressions (1551) 
and (15^ for the potential and force may be used to de¬ 
termine the two functions /(r) and 5 ( 7 ’): 


1 fir) 

2 fir) 

and this is solved by 

fir) = e 


1 , 

(33) 

fir) 

= e^^/' 

— r 



GmM 

( 

= —m- 



J.2 



^ gf) 

1 

,2$/c= ^ 

(34) 

fir) 


$ 


2GM 


GM 


1 


\/fir)gir) 

1 


1 - 


2GM 


(40) 

(41) 
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This gives us, as a weak-field approximation, the line 
element 


ds^ = - 1 - 


2GM" 




1 - 


2GM 


dr^ 


(dz?2 


• Sin 




(42) 


which is the exact result for the Schwarzschild metric! 

Of course, this is too good to be true. To see what hap¬ 
pened, let us use, instead of the last formula from (1571) . 
the first. This means, we approximate f{r) instead of 
/(r) itself. Then the calculation reads 


— p-*>/c 


V/W = e 

GmM GM 

-- TT- 


, ^ GM 

^ ~2 ~ ^ - 2 " 

c rc 

1 




9{r) = 1 , 


(43) 


which is not the expected result for g(r). 

In the two calculations, we used approximations for 
/(r) which agreed to first order in the small quantity 
GM/r(? (ft! 4)/c^), but not to second order. However, the 
structure of the equations is such that the first-order term 
of g{r) depends on the second-order term of f{r). This 
becomes immediately clear, if we plug the approximation 


f{r) = 1 - 


2GM 

rc^ 


+ P 2 



(44) 


into dsa with the force law ®. No matter whether 
we take the formula with the derivative of /(r) or that 
with the derivative of its square root, we obtain the same 
first-order result for g{r), providing we expand the square 
root correctly to second order: 

g{r) = 1 + 2{1-(3,)^. (45) 

rc 

Our first approximation corresponds to /32 = 0, the sec¬ 
ond to 132 = 1. 

Therefore, to obtain a nontrivial result for g{r), we 
need to know f{r) or, since the relationship (l34l) between 
/(r) and the potential is exact^ the potential $(r) at 
least to the next order in GM/rc?. Knowing the exact 
result for f{r) from GR, we may infer that Eq. (1551) is 
indeed only a lowest-oder approximation.— 

This immediately begs the question whether this is 
true for (|39l) as well. Indeed, if we had to replace the 
force law by Fir) = —GmM/r^ (l -\- ^iGM/rc? -|- ...), 
this would bring in another unknown coefficient and di¬ 
minish our chances of calculating anything meaningful. 

There are two reasons to believe - without prior knowl¬ 
edge of the exact result - that (|3^ is, in fact, exact. The 
first is that a similar thing happened in the case of the 
Rindler metric. A force measured with tethers as de¬ 
scribed turns out to be constant in space just as in a 
Newtonian uniform gravitational field. That the Newto¬ 
nian force law be valid beyond the weak-field approxima¬ 
tion in the spherically symmetric situation as well was 
one of the two postulates introduced by Tangherlini— 


It may be considered an extrapolation from the homoge¬ 
neous to the inhomogeneous case, hence a suggestive but 
not really a strong argument. However, there is a much 
more convincing way to justify this assumption. 

The force F{r) is a global field, measurable using 
tethers— In principle, the field may be measured in all of 
space (outside the central star). Calculating the integral 
of the force over the surface of a sphere of radius r, we ob¬ 
tain its total flux through that surface. If we evaluate it 
on two concentric shells, the integral should not change, 
if there is no source of the field between the two shells, 
i.e. in vacuum. In fact, experience with both Newtonian 
gravity and electrodynamics suggests this quantity to be 
a fixed multiple of the total “charge” enclosed by the 
shells (mass, electrical charge) that is at the origin of the 
field. This charge should be a conserved quantity - in GR 
we expect it to be related to mass-energy. Therefore, if 
such a conservation law prevails in GR, the force must 
be proportional to 1 /r^ outside the spherically symmet¬ 
ric mass distribution, if the area of the surface of a sphere 
grows as r^. But we defined our coordinate r precisely 
so that this be the case. 

Therefore, we will assume in the following that (|39)) is 
exact. This requirement goes beyond SR, the EP, and 
the Newtonian limit. It is a partial replacement of Ein¬ 
stein’s assumption that in vacuum the Ricci tensor Rik 
must be required to vanish.— In fact, it can be shown 
to follow from the vanishing of the diagonal temporal 
component Ru- Being a much weaker postulate than 
Einstein’s, it gives us much less. The latter produces the 
exact Schwarzschild solution, the former fixes one of the 
functions / and g in terms of the other. So if we knew 
the expansion of /(r) in powers of GM/rc?, we could 
fully calculate the corresponding expansion of g{r). But 
at this stage, we cannot even determine the coefficient 
?2- 

We conclude that just using the EP in trying to tran¬ 
scend SR and Newtonian gravity gives us the first-order 
term of the expansion of f{r) in powers of GM/rc"^, but 
nothing more (/(r) = 1 -|- fiGM/rc^ with = —2), 
in accord with Refs. [2l| and [l^. Our additional as¬ 
sumption about the validity of (l3^ beyond the weak- 
field limit produces a relationship between the coeffi¬ 
cients of /(r) and g{r). In particular, setting g(r) = 
1/ (l — aiGM/rc? — ...), we find 


ai = 2 (1 - 132 ) . (46) 

Essentially, our new postulate rests on the assumption 
that mass-energy is the only source of the gravitational 
field. It is not expected to hold in alternative theories 
of gravity, in which additional sources of the field are 
present. The Brans-Dicke theory, for example, has a 
scalar field leading to a variable effective gravitational 
constant. This bears some similarity to electrodynamics 
in a medium with varying dielectric coefficient, in which 
there would be apparent electrical charge distributions 
leading to a non-vanishing divergence of the electric field 
iy-E ? d). So the electrical field of a point charge in such 
a polarizable medium would not fall off as I/r^. Indeed, 
checking whether the postulate is satisfied in spherically 














( 51 ) 


symmetric solutions of the Brans-Dicke theory, we find 
that it is not, unless the scalar field is constant. 

To obtain the exact result for /(r) and g{r), we would 
need a second postulate. While it is possible to gener¬ 
ate a more plausible postulate than Tangherlini’s second 
one, and while this postulate is not subject to Rindler’s 
criticism)^ both its physical justification and its prac¬ 
tical use are somewhat more complex than that of the 
postulate about mass-energy conservation invoked so far 
(albeit still simpler than the field equations). Therefore, 
the presentation of that route to the exact Schwarzschild 
metric will be postponed to a different publication. It 
may not really be suited for the classroom at the early 
stage envisaged. Instead, we will take the point of view 
here, that if we cannot determine either ai or /32 from 
theoretical arguments, why not turn to experiments! 


IV. THE PERIHELION PRECESSION OF 
MERCURY 


As it is assumed that the class has no prior knowledge 
on GR, we first need to provide an approach to the equa¬ 
tions of motion of a particle in a given metric, i.e., the 
geodesic equations. This is done in the argument from 
Eqs. m to (l53l) . based on the EP. If the material of 
this paper is used at a later stage in a GR course and 
the geodesic equations are already known, this argument 
may be skipped and reference can be made to the stan¬ 
dard approach to the geodesic equations via the exact 
form of the Lagrangian given in (ISTl) . 

We write the line element as 


ds2 = -c^dr^ = c^dt^ + 


■dr^ 


1 — a{r) 

(dz?2+ sin2z?d<p2) (47) 


with 


g2$(r)/c= ^ ^ ^ ^ 

r 

m2 /m^' 
P{r) = 132^ + O { ^ 


M = ^ , (48) 


(49) 


M M2 


M3 


(r) =ai -h a 2 ^ + O — . 


(50) 



Herein, the first approximation uses the smallness of M/r 
and the second takes advantage of the fact that a planet 
moves slowly in comparison with the speed of light, so 
t Ri I, i.e., global and proper time are almost the same, 
and derivatives with respect to the proper time may be 
replaced by derivatives with respect to t. The last line 
finally identifies the kinetic energy T and the potential 
energy V in the Newtonian limit. The NL of L is a New¬ 
tonian Lagrangian, which suggests that L itself might be 
a relativistic Lagrangian. 

This conjecture can in fact be proven on the basis of the 
EP. What the principle tells us is that in a freely falling 
frame the local laws of motion are those of SR. Hence, in 
such a frame, obtainable by an appropriate local coordi¬ 
nate transformation from the global metric, a point mass 
moves along a straight line, which we can determine from 
local initial conditions, then transform back to obtain a 
piece of the trajectory in the global frame, which gives 
the initial conditions for the next (close-by) local frame 
to which we may transform. Gontinuing the procedure, 
we obtain a piecewise construction of the trajectory. A 
more elegant way rather than to construct pieces of the 
trajectory is to produce equations of motion in the global 
frame from those of the local frames and then find the full 
solution in the global frame directly. Now it is obvious 
that with the Minkowski line element the quantity 

L' = |«G|(;i:^+y2 + Z“_cV) (52) 

is a valid Lagrangian for special relativistic motion of a 
free particle. All coordinates are cyclic, so the equations 
of motion state that T, X, Y, Z are linear functions 
of the proper time, which means that the four-velocity 
is constant. These are the correct equations of motion 
in SR. Transforming this Lagrangian back to the global 
frame is trivial, because both ds and dr are relativistic 
invariants (as is to), so the result of the transformation 
is L of Eq. (ISTl) . Hence, we can derive the equations of 
motion in the metric from L. 

Are there solutions with d = 7r/2 = const, as in the 
Newtonian case? The equation of motion for "i? 

d dL dL d 2 o 2 • o Q -2 n 

-- :—-—=—mr V — mr sintlcosw c? =0 (53) 

dr d-d d-d dr r v ; 


M is assumed to be the mass of the sun and we treat a 
planet (Mercury) with mass to moving in its gravitational 
field. Gonsider now the quantity (an overdot signifies a 
derivative with respect to proper time) 


is obviously solved by d = ^, so we may indeed restrict 
ourselves to motion in the equatorial plane. 

Both (fi and t are cyclic coordinates leading to conser¬ 
vation laws: 


L 


m 

/ds^ 


~2 

Vdr ; 

' 2 [ 


-I- _|_ J.2 gijj2 ^ ^2 j _ _g 

y (r'^ + r'^'d'^ + r^ sin^ -d 
Y {rlYr'^d\+r'^s\rPdip\) 


2$(r)/c=^2p 

I — P — m^{r) P 

- —P — TO4>(r) 


r'^if = h = const. (54) 

e2$/c j. _ ^ (55^ 

Equation (I54|) describes conservation of the component 
of angular momentum perpendicular to the equatorial 
plane, and Eq. (1551) expresses energy conservation. 

Finally, instead of writing down the Euler-Lagrange 
equation for r (the many r dependent terms would lead 
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to a messy formula), we exploit the constancy of the La- 
grangian itself^ to obtain another integral of the motion: 

2 2i2 I ^ -2 I 2 -2 lr:r\ 

—c = —e ' c t H-TT^ + V ■ (56) 

1 — a(r) 


Herein, we may consider the term multiplied by M a 
small perturbation, first solve the equation with M set 
equal to zero and then correct the result using pertur¬ 
bation theory. The lowest-order equation is u" + u = 
GM/h'^ and it is solved by 


Using (I5H) and (I55|) . we can separate out an equation for 
the radial coordinate alone 


1 

uo = — 
ro 


GM 


(1 -I- ecos(f)) , 


(61) 


—+ ^ + f 1 - fc2e-2^/^^) = 0 . (57) 

1 — a[r) V / 

(The Newtonian limit of this equation is obtained by let¬ 
ting c —>■ oo, which implies fc —>■ 1 and leads to the famil¬ 
iar h?— 2GM/r = 0.) We are interested in the 
spatial trajectory only, i.e., the function r[tp), so we write 
r = dr/d^f> dtp/dr = r'p = r'h/r'^. It is then convenient 
to introduce the new variable u{p) = 1/r, whence u' = 
—r'jr^. Using the expansions of a(r) and exp(2<i)(r)/c^), 
multiplying the equation by W — a{r(u)y\/h? and expand¬ 
ing all terms to second order in Mu, we obtain after a 
rearrangement of terms: 

+ u^ - aiMu^ - a2M'^u* 

2 

- ^ (2Mu -f fc2(4 - 2ai - I32)M‘^u^^ 

2 

-I- ^ (l - P) (^1 -I- (2 - ai)Mu - a 2 M^u‘^'^ = 0 . 

(58) 


the well-known Newtonian result. The eccentricity e is 
one of the integration constants. Another would be the 
angle po between the semi-major axis and the x axis, 
which has been absorbed into a redefinition of the x axis, 
so the perihelion is at (^ = 0. The results for and a 
used in deriving (1591) immediately follow from (1611) . Note 
that only the term oc has to be treated within per¬ 
turbation theory. Without it, Eq. (1501) would be solvable 
exactly. Because we still would have to treat one term 
perturbatively, we might as well consider all terms mul¬ 
tiplied by M perturbations. 

Now we iterate the equation, inserting uq on the right- 
hand side, to obtain the first-order correction: 


u''+u = 


GM 

'W 


M 


G^M^ 

~hd~ 


2^1 


-I- — cos 2pj {d — 2ai 

- (i - y) ■ 


/ 

I 1 -I- -I- 2e cos p 

- j32) (1 -I- ecosv?) 

(62) 


In order to simplify this equation, we consider the sizes 
of its terms. The leading order terms are — 

2c^Mu/h? c^(l — k'^)/h?. While Mu is very small;^ 
the first term linear in u is multiplied by c^, a large fac¬ 
tor. This is the reason why we have to take into account 
the u^ term in the first parentheses. However, we may 
drop the 02 term in the first line and also the one mul¬ 
tiplied by c^(l — k‘^)/h?, because in this term the large 
factor is compensated by the small factor 1 — To 
see this, let us estimate fc, by evaluating Eq. (EH) at the 
perihelion, where r = 0, and taking the NL. The New¬ 
tonian value for the minimum distance of the planet to 
the center of motion is rmin = h? /{GM{1 -I- e)), where e 
is the eccentricity and h twice the areal velocity, referred 
to Newtonian time instead of proper time. We find 


fc" - 1 = - 


GM 


(59) 


where a = rinin/(l — e) is the semi-major axis of the 
orbital ellipse of the planet considered. The result is 
twice the orbital energy of the planet divided by mc^, a 
very small quantity indeed. 

Having justified the neglect of the 02 terms in (l58)) . 
we take the derivative with respect to p (to obtain a 
linear lowest-order equation), and get, after dividing off 
the common factor 2u' 



GM 


This is the equation of motion of a driven harmonic os¬ 
cillator with resonant terms on the right-hand side (the 
terms oc cos p). A straightforward treatment would lead 
to self-amplifying solutions, destroying the applicability 
of perturbation theory. Therefore, we use a slightly more 
sophisticated approach, the Poincare-Lindstedt method, 
in which the argument of the solution is considered a 
function of the perturbation, too. Restricting ourselves 
to the lowest-order scheme, we write u(}p) = m((1 -I- e)p), 
with e being proportional to the small parameter M. We 
then have u"+It = (l-|-e)^M"-|-'u ~ (l + 2e){("-|-ft and set¬ 
ting u = UQ+Mui,we obtain {iQ-|-2euQ-|-MM"-|-{to-l-Mui 
on the left-hand side of (15^ . Since uq = uq, the term 
multiplied by e is proportional to cosi^ and by an ap¬ 
propriate choice of e, we may cancel the secular terms 
on the right-hand side. Then ui satisfies an equation of 
the type u" -I- ui = A + B cos 2p, but we are not partic¬ 
ularly interested in solving it, as the information about 
the perihelion precession is in the modified periodicity of 
the solution, determined already by the value of e. The 
new period is P = 27r/(l -|- e) Ri 27r(l — e), hence the 
perihelion shift AP per period is 

AP = —27re = ttM^—^ {ai — 1^2 + d) , (63) 

where we have replaced by 1. The formula may be 
recast in terms of more convenient quantities, h is twice 
the areal velocity, hence in the approximation of a Kepler 
ellipse 


h = 




(60) 


T 


(64) 
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with T its orbital period, and from Kepler’s third law 


rp2 


GM ’ 


(65) 


we obtain an expression for GM. Combining the two 
results, we have {GM)"^jh? = 47r^a^/(T^(l — e^)) and 
find 


+ ■ ( 66 ) 

The annual perihelion shift AP^ is obtained from this by 
multiplying with Tearth/T', which is a factor of 4.152 for 
Mercury, having an orbital period of 87.969 d. Plugging 
in numbers, we get APq = 3.4730 x 10“'^ (ai — /32 + 4). 
This is the result in radians. To convert it to arcseconds, 
we note that 1" = 27r/360/3600rad = 4.8481 x 10“®rad. 
Then we have APa = 0.0716" x {a\ — /32 + 4). Experi¬ 
mental measurements give APa = 0.4298")^ from which 
we infer 


«! — ,02 + 4 


0.4298" 

0.0716" 


6.003. 


This immediately leads to the conjecture 


«! — /?2 = 2 . 


(67) 


( 68 ) 


Together with ai = 2{1 — (32) from Sec. IIIIl we end up 
with 


ai = 2 , /32 = 0 . (69) 

Therefore, we have now inferred the Schwarzschild met¬ 
ric to second-order accuracy in the small parame¬ 
ter GM/rc? for the coefficient gtt = f{r) and to first- 
order accuracy for grr = gi^), which is known as the first 
parameterized post-Newtonian approximation (PPN);^ 
Clearly, the two parameters determined in this section 
from experimental information would be known only with 
finite precision. They could not be claimed to be exact 
without the benefits of the field theory. 

We conclude that Einstein might indeed have used 
a similar approach in 1911 and would then have been 
able to correctly predict gravitational light deflection by 
the sun five years earlier than he actually did. Also, 
he might have found the Schwarzschild solution before 
Schwarzschild^ and Drostei^ 

As it turns out, experimental information on light de¬ 
flection gives much simpler access to the coefficient ai 
than perihelion precession data. So let us turn to a brief 
analysis of the behavior of light in the gravitational field 
of a spherically symmetric mass distribution. 

Unfortunately, this experimental informaton was not 
available before 1919 and then only with low accuracyi^ 


V. LIGHT DEFLECTION 


favorable, as it leads to simpler equations and allows one 
to repeat the argument about the factor of 2 between 
the EP prediction and the full calculation, given below. 
Given the equations for null geodesics, one may eliminate 
the affine parameter, determine the energy constant k 
from the limit r —^ oo and introduce the variable u = b/p, 
which produces Eq. (|79|). 

Assuming instead no prior knowledge of the geodesic 
equations, it may be argued that just as the principle of 
least action governs the motion of particles and gives us 
the equations of motion, once we know the Lagrangian, 
Eermat’s principle governs the paths of light and gives 
us their equations, once we know the index of refraction 
or, equivalently, the speed of light. Both principles have 
the advantage of being coordinate free, so we may expect 
them to work in curved spacetime without problems. It 
should not matter whether a given coordinate system de¬ 
scribes a patch of flat or of curved spacetime. 

Let us therefore put Eermat’s principle to use in de¬ 
scribing light deflection. It is obvious that the coordinate 
speed of light, obtained from the line element by setting 
ds^ = 0, varies in a metric such as (|26)) . We may inter¬ 
pret this in terms of a variable refractive index n, and 
then require, in order to calculate the path of light 

6Sf = 0 for Sf = — J din 

where dl is the (coordinate) length element of the path 
of the light ray and c{l) is the local speed of light. The 
endpoints of the path are supposed to be fixed. Hereafter, 
I will slightly abuse notation in taking c for the universal 
speed of light and c{x) for the coordinate speed of light 
at some point x. 

When written in the form (EOl), the principle requires 
n(x) and hence c{x) to be a scalar function, i.e., the ve¬ 
locity of light should be isotropic. This is clearly not true 
for the general metric (E5)) . The velocity of light in the ra¬ 
dial direction is given by Cr{r) = dr/dt = \/f(j) cj \J g{r) 
(setting ds, d'd and dg} equal to zero), whereas the trans¬ 
verse speed of light is c* (r) = ^f{r) c (setting ds and dr 
equal to zero). 

While it is possible to phrase Fermat’s principle for an¬ 
isotropic light propagation^^— the necessity to first de¬ 
rive this unfamiliar formulation makes it unattractive for 
work in class. Instead, we rewrite our metric to spatially 
isotropic form. This can be achieved via introduction of 
a new radial coordinate p as follows: set r = r[p) and 
require 

5(r)r'(p)2 = 4- (71) 

P 

Solving this differential equation for r gives a metric with 
spatial part g{r{p)) r'{p)^ (dp^ -I- p^dd^ + p^ sin^ d dip^). 
For the metric (iTTl) with a{r) approximated by the first 
term of dsni), this procedure leads to 



Again, if the geodesic equations are supposed known, 
part of this section can be skipped. Even then, the use 
of isotropic Schwarzschild coordinates suggested here is 


r = p 



aiM 

Ap 


2 


(72) 
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where the integration constant has been chosen so that 
far from the central mass both coordinates become equal 
to each other. In the new coordinates, the spacetime line 
element takes the form 


ds^ = —kf (? df' , (73) 

where 


dl^ = dp^ + p^dd^ + sin^ d dp^ , 



(74) 


The local coordinate speed of light is then given by 


c{p) 



(2 + ai)M\ 

2p J 



(75) 


For symmetry reasons, we expect the path of a light ray 
in the equatorial plane = 7r/2 to remain in that plane, 
so we can drop the dd contribution to the spatial coor¬ 
dinate line element dl. We are then left with the task to 
minimize 


, f ,/dp^ + p^dip^ 

—sw— =y c(,M) 

Taking the integrand to be a function SFip'i^p)^ pip’), <p), 
we note it does not depend on ip explicitly, so the “Hamil¬ 
tonian” 


idsF 


— Sf 


is constant. Naming the constant —bjc, we find 


(77) 


u c{p) 

Vp'" + c 


(78) 


Solving this algebraically for p' and substituting u = b/p, 
we obtain 


u^ = 


z[p(u))‘ 


= 1-b 


(2 + ai)Mu 



(79) 


Differentiating with respect to <p and dividing through 
by 2u', we end up with an extremely simple equation 


// , (2 + Q!i)M 

" +“ = —— 


(SO) 


This is a shifted harmonic oscillator. Rewritten in terms 
of p, the solution reads 


b 

P 


sin((p - (po) + 


(2-f ai)M 


(81) 



FIG. 2. Pictorial representation of the trajectory of a photon 
and its orientation in the coordinate system with x = pcos ip, 
y = psinip. Small angles are generously exaggerated in size. 


From this equation, describing a hyperbola, we can read 
off the deflection angle. For convenience, we set (po = 0, 
which gives the hyperbola the orientation shown in Fig.[^ 
(assuming 6 > 0). 

For p —>■ oo, given the smallness of the second term 
on the right-hand side of Eq. dUD, the argument of the 
sine must go to (pi = —(1 -I- ai/2)M/5 for x > 0 and to 
(p 2 = Ti" + (1 + q;i/ 2)M/6 for X < 0. The deflection angle 
then is 

^ ai \ 2GM 

S^^,-F-ip,= (l + -)^. (82) 

With ai = 2, Einstein’s 1916 result S = dGM/{bc^) is 
recovered, corresponding to an angle of 1.75” for a light 
ray that grazes the surface of the sun (i.e., when b is equal 
to the radius of the sun). 

The utility of isotropic coordinates in the calculation 
of this effect may be underlined by a comment regard¬ 
ing the factor of 2 between the correct (first-order) result 
and Einstein’s original result. In his 1911 paper, Einstein 
derived the speed of light from the equivalence principle 
alone, which means that he replaced the local gravita¬ 
tional field by a patch of an inertial field with a value gtt 
adapted to the true field, but effectively with = 1- 
He therefore obtained 



(83) 


Plugging this velocity into Fermat’s principle, we get a 
formula in which the deviation of the coordinate speed 
of light from the vacuum speed of light is formally half 
the value of the deviation in the isotropic Schwarzschild 
metric, if we rename r to p (compare with Eq. (1751) for 
tti =2). Since the angular deflection is so small that it 
is easily captured by perturbation theory, the final result 
must be linear in this deviation (from the zeroth-order 
straight-line solution), so the angle of deflection obtained 
in the full theory must be a factor of 2 larger than the 
one obtained by the EP only. No argument of compara¬ 
ble simplicity is available when comparing a calculation 
based on the EP with the full theory using the original 
Schwarzschild metric. In that metric, the speed of light 
is anisotropic, agreeing with the prediction from the EP 
for transverse light rays and differing from it for radial 
ones. (Far from the sun, the ray is essentially radial.) 

Note that if we assume the experimental result on light 
deflection to be available, we can deduce the value of cxi 


2b 
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without the need to require that (1391) is valid beyond 
the lowest order in M. Hence the experiment allows us 
to determine this coefficient without any second-order 
knowledge of /(r), whereas we need this information, if 
we want to obtain ai from the perihelion precession. How 
does this come about? 

Consider the complete Lagrangian of Eq. (ICT) . In it, 
the term oc is much smaller than the term oc be¬ 
cause the velocity r of a planet is much smaller than c. 
So in order to obtain an accurate estimate of the small 
quantity a\M jr appearing as a factor in the former term, 
we have to know much smaller factors of the latter term, 
i.e., we have to calculate /(r) to second-order precision. 
The case of light bending is different. Here the dr^ and 
dt^ terms of the line element are the same order of mag¬ 
nitude, because for light dr/dt is on the order of c. Thus, 
to determine grr accurate to first order from experiment, 
it is sufficient to know gtt to hrst order. 

VI. CONCLUSIONS 

It is well-known that out of the three classical tests 
of general relativity, the gravitational redshift, essen¬ 
tially explicable in terms of the EP, does not probe the 
field equations, whereas light deflection in a gravitational 
field and the perihelion precession do depend on them. 
The same is true for the fourth test conceived later, the 
Shapiro delay, not discussed in detail here. 

We have explored in this paper, how far some sim¬ 
ple ideas, not exploiting the full field theory, may carry 
us in determining a usable weak-field approximation of 
the metric outside a spherically symmetric mass distribu¬ 
tion. This amounts to the approximate construction of no 
more than two radial functions. As it was not expected 
that these ideas would generate enough information to 
predict spacetime curvature quantitatively, we were will¬ 
ing to accept one adjustable parameter to emerge from 
one of the three experiments probing the field equations, 
which would then, hopefully, allow us to make quanti¬ 
tative predictions of the other two. In part, this was 
motivated by the increase in credibility that a “physics 
first” approach to GR would gain, if the metric employed 
to derive predictions could be justified without use of the 
field equations. 


It turned out that this program is feasible but that 
the two experiments considered need different levels of 
additional information. 

In the case of the perihelion precession, to progress 
at all we had to make an assumption about the range 
of validity of the force law (15^ . because two unknown 
coefficients of the metric are needed in a lowest-order 
PPN description. Given that new assumption and a mea¬ 
surement of the perihelion precession, we get the met¬ 
ric with sufficient accuracy to predict both the outcome 
of the light bending experiment and the Shapiro effect 
quantitatively.— 

In the case of light bending by the sun, only one pa¬ 
rameter is missing in the metric as far as it is determined 
by the EP and the NL. A measurement of light deflec¬ 
tion fixes this parameter. With the metric so obtained, 
the Shapiro delay could be predicted quantitatively with¬ 
out any additional postulate on the force law. However, 
the same metric would be insufficient to quantitatively 
predict the perihelion precession of Mercury. Assuming 
the force law to be accurate beyond lowest order, enough 
information can be gathered. 

In regard to research, our results may be considered 
not overly interesting, since the exact solution of the field 
equations is known. But they might be of some use in 
the classroom. One of the calculations from Secs. m 
and El can be discussed during a course, the other may 
be assigned as a homework. Physical understanding of 
the spherically symmetric system would be greatly im¬ 
proved. The experience that our attempt at a simplifi¬ 
cation, avoiding the field equations, involves moderately 
extensive calculations at least in the case of the perihelion 
precession, will help students to appreciate the introduc¬ 
tion of the field equations later, which allow, with a little 
more effort, to obtain the exact form of the spherically 
symmetric static metric. 
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